Goto

Collaborating Authors

 jp 0


Minimax optimal transfer learning for high-dimensional additive regression

arXiv.org Machine Learning

Many human tasks benefit from prior experience when that experience is related to the task at hand. This phenomenon, whereby knowledge from previous tasks is transferred to new ones, has motivated the machine learning technique known as transfer learning. From a statistical perspective, consider the problem of analyzing a regression relationship when the available data are limited. Transfer learning (Torrey and Shavlik (2010)), one of the most widely used techniques in machine learning, can provide a solution. In this framework, one typically leverages related estimates obtained from large but non-identically distributed auxiliary samples, and then refines these estimates to obtain improved estimators from the smaller target sample. Transfer learning has been shown to be effective in a wide range of real-world applications, including computer vision (Kolesnikov et al. (2020); Bu et al. (2021)), natural language processing (Lee et al. (2020); Yuan et al. (2020)), and bioinformatics (Vorontsov et al. (2024); Gao and Cui (2020)), among others. Recently, the theoretical properties of transfer-learned estimators have been extensively investigated across a range of statistical problems.


Risk-Averse Stochastic Shortest Path Planning

arXiv.org Artificial Intelligence

We consider the stochastic shortest path planning problem in MDPs, i.e., the problem of designing policies that ensure reaching a goal state from a given initial state with minimum accrued cost. In order to account for rare but important realizations of the system, we consider a nested dynamic coherent risk total cost functional rather than the conventional risk-neutral total expected cost. Under some assumptions, we show that optimal, stationary, Markovian policies exist and can be found via a special Bellman's equation. We propose a computational technique based on difference convex programs (DCPs) to find the associated value functions and therefore the risk-averse policies. A rover navigation MDP is used to illustrate the proposed methodology with conditional-value-at-risk (CVaR) and entropic-value-at-risk (EVaR) coherent risk measures.